Speaker Identification in Emotional Environments
نویسنده
چکیده
The performance of speaker identification is almost perfect in the neutral environment. However, the performance is significantly deteriorated in emotional environments. In this work, three different and separate models have been used, tested and compared to identify speakers in each of the neutral and emotional environments (completely two separate environments). Our emotional environments in this work consist of five emotions. These emotions are: angry, sad, happy, disgust and fear. The three models are: Hidden Markov Models (HMMs), Second-Order Circular Hidden Markov Models (CHMM2s) and Suprasegmental Hidden Markov Models (SPHMMs). Our results show that the three models perform extremely well for speaker identification in the neutral environment. In emotional environments, the average speaker identification performance based on HMMs, CHMM2s and SPHMMs is 61.4%, 66.4% and 69.1%, respectively. Our results in this work are better than those obtained in subjective evaluation by human judges.
منابع مشابه
Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments
Speaker recognition performance in emotional talking environments is not as high as it is in neutral talking environments. This work focuses on proposing, implementing, and evaluating a new approach to enhance the performance in emotional talking environments. The new proposed approach is based on identifying the unknown speaker using both his/her gender and emotion cues. Both Hidden Markov Mod...
متن کاملSpeaker identification investigation and analysis in unbiased and biased emotional talking environments
This work aims at investigating and analyzing speaker identification in each unbiased and biased emotional talking environments based on a classifier called Suprasegmental Hidden Markov Models (SPHMMs). The first talking environment is unbiased towards any emotion, while the second talking environment is biased towards different emotions. Each of these talking environments is made up of six dis...
متن کاملEmploying Emotion Cues to Verify Speakers in Emotional Talking Environments
Usually, people talk neutrally in environments where there are no abnormal talking conditions such as stress and emotion. Other emotional conditions that might affect people talking tone like happiness, anger, and sadness. Such emotions are directly affected by the patient health status. In neutral talking environments, speakers can be easily verified, however, in emotional talking environments...
متن کاملSpeaker Recognition in an Emotional Environment
The goal of this paper is to assess the effect of emotional state of a speaker when text-independent speaker identification is performed. Mel-frequency cepstral coefficients are the features of the speech signal used for speaker recognition. For training the speaker models and testing the system, Support Vector Machines are employed. Berlin emotional speech database, which contains 10 different...
متن کاملEEMD-Based Speaker Automatic Emotional Recognition in Chinese Mandarin
Emotion feature extraction is the key to speech emotional recognition. And ensemble empirical mode decomposition(EEMD) is a newly developed method aimed at eliminating emotion mode mixing present in the original empirical mode decomposition(EMD). To evaluate the performance of this new method, this paper investigates the effect of a parameters pertinent to EEMD: speech emotional envelope. First...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010